Using Emulations to Enhance the Performance of Parallel Architectures

نویسندگان

  • Bojana Obrenic
  • Martin C. Herbordt
  • Arnold L. Rosenberg
  • Charles C. Weems
چکیده

ÐWe illustrate the potential of techniques and results from the theory of network emulations to enhance the performance of a parallel architecture. The vehicle for this demonstration is a suite of algorithms that endow an N-processor bit-serial processor array A with a ameta-instructiono GAUGE k, which (logically) reconfigures A into an N=k-processor virtual machine Bk that has: 1) a datapath and memory bus whose emulated width is k bits, as opposed to A's 1-bit width and 2) an instruction set that operates on k-bit words, in contrast to A's instruction set, which operates on 1-bit words. In order to stress the strength of the approach, we show (via pseudocode) how our emulation techniques can be implemented efficiently even if A operates in strict SIMD mode, with only single-bit masking capabilities and with no indexed memory accesses. We describe at an algorithmic level how to implement our techniqueÐincluding datapath conversion (acorner-turningo) and the creation of the word-parallel instruction setsÐon arrays of any regular network topology. We instantiate our technique in detail for arrays based on topologies with quite disparate characteristics: the hypercube, the de Bruijn network, and a genre of mesh with reconfigurable buses. Importantly, the emulations that underlie our technique do not alter the native machine's instruction set, hence allowing an invariant programming model across gauges. Index TermsÐParallel architecture, multiprocessor interconnection, parallel algorithms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effect of Buildings around Highways on the Efficiency of T-Shaped Parallel Acoustic Barriers: A Theoretical Study

Introduction: Over recent years, various programs have been conducted to reduce noise, indicating the increasing concerns of society over noise pollution. This study has been performed to enhance the performance of parallel barriers on two sides of a highway considering the surrounding buildings. Material and Methods: The 2D boundary element simulation method was employed to predict the noise ...

متن کامل

Performance Analysis of Screening Unit in a Paper Plant Using Genetic Algorithm

This paper deals with the performance analysis of the screening unit in a paper plant using Genetic Algorithm. The screening unit in the paper plant has four main subsystems. These subsystems are arranged in series and parallel configurations. Considering exponential distribution for the probable failures and repairs, the mathematical formulation of the problem is done by Markov birth-death pro...

متن کامل

Ultra-Low-Energy DSP Processor Design for Many-Core Parallel Applications

Background and Objectives: Digital signal processors are widely used in energy constrained applications in which battery lifetime is a critical concern. Accordingly, designing ultra-low-energy processors is a major concern. In this work and in the first step, we propose a sub-threshold DSP processor. Methods: As our baseline architecture, we use a modified version of an existing ultra-low-power...

متن کامل

MATHEMATICAL MODELING AND PERFORMANCE OPTIMIZATION FOR THE DIGESTING SYSTEM OF A PAPER PLANT

This paper deals with the mathematical modeling and performance optimization for the Digesting system of a Paper Plant using Genetic Algorithm. The Digesting system of a Paper Plant has four main subsystems, arranged in series and parallel. Considering exponential distribution for the probable failures and repairs, the mathematical formulation of the problem is done using probabilistic approach...

متن کامل

Bit Swapping Linear Feedback Shift Register For Low Power Application Using 130nm Complementary Metal Oxide Semiconductor Technology (TECHNICAL NOTE)

Bit swapping linear feedback shift register (BS-LFSR) is employed in a conventional linear feedback shirt register (LFSR) to reduce its power dissipation and enhance its performance. In this paper, an enhanced BS-LFSR for low power application is proposed. To achieve low power dissipation, the proposed BS-LFSR introduced the stacking technique to reduce leakage current. In addition, three diffe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Parallel Distrib. Syst.

دوره 10  شماره 

صفحات  -

تاریخ انتشار 1999